Patterns of segmental duplication in the human genome.
نویسندگان
چکیده
We analyzed the completed human genome for recent segmental duplications (size > or = 1 kb and sequence similarity > or = 90%). We found that approximately 4% of the genome is covered by duplications and that the extent of segmental duplication varies from 1% to 14% among the 24 chromosomes. Intrachromosomal duplication is more frequent than interchromosomal duplication in 15 chromosomes. The duplication frequencies in pericentromeric and subtelomeric regions are greater than the genome average by approximately threefold and fourfold. We examined factors that may affect the frequency of duplication in a region. Within individual chromosomes, the duplication frequency shows little correlation with local gene density, repeat density, recombination rate, and GC content, except chromosomes 7 and Y. For the entire genome, the duplication frequency is correlated with each of the above factors. Based on known genes and Ensembl genes, the proportion of duplications containing complete genes is 3.4% and 10.7%, respectively. The proportion of duplications containing genes is higher in intrachromosomal than in interchromosomal duplications, and duplications containing genes have a higher sequence similarity and tend to be longer than duplications containing no genes. Our simulation suggests that many duplications containing genes have been selectively maintained in the genome.
منابع مشابه
A Parsimony Approach to Analysis of Human Segmental Duplications
Segmental duplications are abundant in the human genome, but their evolutionary history is not well-understood. The mystery surrounding them is due in part to their complex organization; many segmental duplications are mosaic patterns of smaller repeated segments, or duplicons. A two-step model of duplication has been proposed to explain these mosaic patterns. In this model, duplicons are copie...
متن کاملRelationship between Chromosome Rearrangement and Repeat Sequences in Human Chromosome 7
A various types of repeat patterns are abundant in genomic sequence, and are associated with the biological phenomena at distinct levels. In particular, comparative analyses of whole-genome-sized sequence data reveal that the periodic sequences cause the segmental duplication that is a type of chromosomal structural arrangement [2]. In this study, we analyze the relationships between the large-...
متن کاملParsimony and likelihood reconstruction of human segmental duplications
MOTIVATION Segmental duplications > 1 kb in length with >or= 90% sequence identity between copies comprise nearly 5% of the human genome. They are frequently found in large, contiguous regions known as duplication blocks that can contain mosaic patterns of thousands of segmental duplications. Reconstructing the evolutionary history of these complex genomic regions is a non-trivial, but importan...
متن کاملEvolution of Arabidopsis microRNA families through duplication events.
Recently there has been a great interest in the identification of microRNAs and their targets as well as understanding the spatial and temporal regulation of microRNA genes. To understand how microRNA genes evolve, we looked at several rapidly evolving families in Arabidopsis thaliana, and found that they arose from a process of genome-wide duplication, tandem duplication, and segmental duplica...
متن کاملAnalysis of segmental duplications via duplication distance
MOTIVATION Segmental duplications are common in mammalian genomes, but their evolutionary origins remain mysterious. A major difficulty in analyzing segmental duplications is that many duplications are complex mosaics of fragments of numerous other segmental duplications. RESULTS We introduce a novel measure called duplication distance that describes the minimum number of duplications necessa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Molecular biology and evolution
دوره 22 1 شماره
صفحات -
تاریخ انتشار 2005